Miroslav Pǐstěk Approximate Dynamic Programming based on High Dimensional Model Representation 2310 October 2011
نویسنده
چکیده
In this article, an efficient algorithm for an optimal decision strategy approximation is introduced. The proposed approximation of the Bellman equation is based on HDMR technique. This non-parametric function approximation is used not only to reduce memory demands necessary to store Bellman function, but also to allow its fast approximate minimization. On that account, a clear connection between HDMR minimization and discrete optimization is newly established. In each time step of the backward evaluation of the Bellman function, we relax the parameterized discrete minimization subproblem to obtain parameterized trust region problem. We observe that the involved matrix is the same for all parameters owning to the structure of HDMR approximation. We find eigenvalue decomposition of this matrix to solve all trust region problems effectively. The achieved estimates of minima are immediately stored in HDMR approximation to avoid a full-domain representation of Bellman function. We assume that the newly developed approximate minimzation of HDMR can be beneficial also in other applications.
منابع مشابه
Approximate dynamic programming based on high dimensional model representation
This article introduces an algorithm for implicit High Dimensional Model Representation (HDMR) of the Bellman equation. This approximation technique reduces memory demands of the algorithm considerably. Moreover, we show that HDMR enables fast approximate minimization which is essential for evaluation of the Bellman function. In each time step, the problem of parametrized HDMR minimization is r...
متن کاملHigh dimensional model representation
Abstrakt: In practical applications of the control theory, there is lack of approximation techniques applicable to intractable dynamic-programming equations describing the optimality controller. In the paper, we consider use of the technique coming from chemistry and called high dimensional model representation (HDMR). Its main advantages are finite order of expansion and rapid convergence for ...
متن کاملOPTIMIZATION OF A PRODUCTION LOT SIZING PROBLEM WITH QUANTITY DISCOUNT
Dynamic lot sizing problem is one of the significant problem in industrial units and it has been considered by many researchers. Considering the quantity discount in purchasing cost is one of the important and practical assumptions in the field of inventory control models and it has been less focused in terms of stochastic version of dynamic lot sizing problem. In this paper, stochastic dyn...
متن کاملThree dimensional static and dynamic analysis of thick plates by the meshless local Petrov-Galerkin (MLPG) method under different loading conditions
In this paper, three dimensional (3D) static and dynamic analysis of thick plates based on the Meshless Local Petrov-Galerkin (MLPG) is presented. Using the kinematics of a three-dimensional continuum, the local weak form of the equilibrium equations is derived. A weak formulation for the set of governing equations is transformed into local integral equations on local sub-domains by using a uni...
متن کاملApproximate Incremental Dynamic Analysis Using Reduction of Ground Motion Records
Incremental dynamic analysis (IDA) requires the analysis of the non-linear response history of a structure for an ensemble of ground motions, each scaled to multiple levels of intensity and selected to cover the entire range of structural response. Recognizing that IDA of practical structures is computationally demanding, an approximate procedure based on the reduction of the number of ground m...
متن کامل